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1. Introduction 

Let (Xj,Yj), l<i<N,beN>l independent copies from a couple (X, Y) 
of independent positive random variables (rv’s) defined over some probability space 
(f2, A, P) , with continuous marginal distribution functions (df’s) F and G respectively. 
Suppose that X is right-truncated by Y, in the sense that X,; is only observed when 
Xj < Yj. We assume that both survival functions F := 1 — F and G := 1 — G are 
regularly varying at infinity with respective indices — 1/71 and — 1 / 72 . That is, for 
any s > 0 

lim = s and lim = s ~Yi^ (Li) 

x —^00 p i^c) y—too G(y) 

Being characterized by their heavy tails, these distributions play a prominent role 
in extreme value theory. They include distributions such as Pareto, Burr, Frechet, 
stable and log-gamma, known to be appropriate models for fitting large insurance 
claims, log-returns, large fluctuations, etc... (see, e.g., Resnick, 2006). The trunca¬ 
tion phenomenon may occur in many fields, for instance, in insurance it is usual that 
the insurer’s claim data do not correspond to the underlying losses, because they 
are truncated from above. Indeed, when dealing with large claims, the insurance 
company stipulates an upper limit to the amounts to be paid out. The excesses over 
this fixed threshold are covered by a reinsurance company. This kind of reinsurance 
is called excess-loss reinsurance (see, e.g., Rolski et ah, 1999). Depending on the 
branches of insurance, the upper limit, which may be random, is called in different 
ways: in life insurance, it is called the cedent’s company retention level whereas 
in non-life insurance, it is called the deductible. For a recent paper on randomly 
right-truncated insurance claims, one refers to Escudero and Ortega ( 2008). 

Let us now denote (A 7 ,, Yj) , i = 1,..., n, to be the observed data, as copies of a couple 
of rv’s (A, Y) with joint df H , corresponding to the truncated sample (Xj,Yj), 
i — 1, ..., IV, where n = is a sequence of discrete rv’s. By the law of the large 

p 

numbers, we have njy/N —> p := P (X < Y), as N —> 00 . For convenience, we use, 

p 

throughout the paper, the notation n —> 00 to say that n —> 00 . For x,y > 0, we 
have 

H{x,y) :=P (X<x,Y<y) 

= P (X < x, Y < y | X < Y) = p- 1 [ F (min (y, z )) clG (z). 

Jo 

Note that, conditionally on n, the observed data are still independent. The marginal 
distributions of the observed X's and Y's, respectively denoted by F and G , are 
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equal to 


F (x) = p- 1 / G {z) dF (z) and G (y) = p" 1 / F (z) dG (z ), 


it follows that the corresponding tails 


POO POO 

F (x) = -p - 1 / G (z) dF (z) and G (y) = -p ' 1 / F (z) dG (z). 

J x J y 

It is clear that the asymptotic behavior of F simultaneously depends on G and F 
while that of G only relies on G. Making use of Potter’s bound inequalities (see 
Lemma 6 . 3 ), for the regularly varying functions F and G, we may readily show that 
both G and F are regularly varying at infinity as well, with respective indices 72 
and 7 := 7172/ (71 + 72) • That is, we have, for any s > 0 , 

lim = s~ lh and lim = s' 1/72 . (1.2) 

F ( x) y^°° G (y) 

Recently Gardes and Stupfler ( 2014 ) addressed the estimation of the extreme value 
index 71 under random truncation. They used the definition of 7 to derive the 
following consistent estimator: 


71 (k, k') := 


7 (fc) 72 (fc') 

72 (k 1 ) ~ 7 (k) ’ 


where 

k k' 

7 (k) lQ g A y~ ,+1:W and ^2 (k') log w~ t+1:n > ( L3 ) 

™ -A-n—k-.n & ■ -t Xn—k':n 

i=l i=l 

are the well-known Hill estimators of 7 and 72, with X 1:n < ... < X n:n and F 1:n < 
• •• < Y n -.n being the order statistics pertaining to the samples (X\, ...,X n ) and 
(Y 1 ,...,Y n ) respectively. The two sequences k = k n and k! = k! n of integer rv’s, 
which satisfy 


1 < k, k! < n, min (k, k') —> 00 and max (k/n, k!/n) —> 0 as n —> 00, 


respectively represent the numbers of top observations from truncated and trunca¬ 
tion data. By considering the two situations kfk! —> 0 and k'/k —> 0 as n —» 00, the 
authors established the asymptotic normality of 71 (k, k '), but when k/k’ —> 1, they 
only showed that 

\J min (k, k') (71 (k, k!) — 71) = O p (1), as n —» 00, 

which is not enough to construct conhdence intervals for 7^ It is obvious that an 
accurate computation of the estimate 71 (k, k r ) requires good choices of both k and 
k'. However from a practical point of view, it is rather unusual in extreme value 
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analysis to handle two distinct sample fractions simultaneously, which is mentioned 
by Gardes and Stupfler (2014) in their conclusion as well. For this reason, we con¬ 
sider, in the present work, the situation when k — k' rather than k/k' —> 1. Thus, 
we obtain an estimator 


7i : = 7i (k) 


k Y 

\ ^ -A-n—i-\-l:n 

log 


„-l i=l 


X n _ 


n—k:n 


^ log 


i =1 


z+l:n 

V 

1 n—k:n 


J^iog 


2=1 


A n-fc:n J n—z+l:n 

V V 
1 n—k-.n^-n—i+l:n 


(1.4) 


of simpler form, expressed in terms of a single sample fraction k of truncated and 
truncation observations. The number of extreme values used to compute the optimal 
estimate value 71 may be obtained by applying one of the various heuristic methods 
available in the literature such that, for instance, the algorithm of page 137 in 
Reiss and Thomas (2007), which will be applied in Section 3. 


The task of establishing the asymptotic normality of 71 is a bit delicate as one 
has to take into account the dependence structure of X and Y. The authors of 
Gardes and Stupfler (2014) avoided this issue by putting conditions on the sample 
fractions k and k'. In our case we require that the joint df H have a stable tail 
dependence function £ (see Huang, 1992 and Drees and Huang, 1998), in the sense 
that the following limit exists: 

lim t~ 1 P (F (X) <txovG (Y) < ty) =: £ (x, y) , (1.5) 

for all x, y > 0 such that max (x, y ) > 0. Note that the corresponding tail copula 
function is defined by 

lim r'P (F (X) < tx, G (Y) < ty) =: R (x, y) , (1.6) 

which equals x + y — £ (x, y). In on other words, we assume that H belongs to the 
domain of attraction of a bivariate extreme value distribution. This may be split 
into two sets of conditions, namely conditions for the convergence of the marginal 
distributions (1.2) and others for the convergence of the dependence structure (1.5) . 
For details on this topic, see for instance Section 6.1.2 of de Haan and Ferreira (2006) 
and the papers of Huang (1992), Schmidt and Stadtmiiller (2006), Einmahl et al. 
(2006), de Haan et al. (2008) and Peng (2010). 


The rest of the paper is organized as follows. In Section 2, we give our main result 
which consists in a Gaussian approximation to 71 only by assuming the second-order 
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conditions of regular variation and the stability of the tail dependence function. A 
simulation study is carried out, in Section 3, to illustrate the performance of 71 . 
Section 4 is devoted to an application, as we derive the asymptotic normality of an 
excess-of-loss reinsurance premium estimator. Finally, the proofs are postponed to 
Section 5 whereas some results that are instrumental to our needs are gathered in 
the Appendix. 


2. Main results 


Weak approximations of extreme value theory based statistics are achieved in the 
second-order framework (see de Haan and Stadtmiiller, 1996). Thus, it seems quite 
natural to suppose that both df’s F and G satisfy the well-known second-order 
condition of regular variation. That is, we assume that for any x > 0 


lim 


U (. zx) 
z->°° A (z) V U (z) 
U 2 (zx) 


X 1 = X 1 


x r - 1 


r 


lim —- . 

z->ocA 2 (z) \ U 2 (z) 


X 1 


>72 I — ^72 


X 


- 1 


(2.7) 


t 2 


where U := (l/F))* - , U 2 := (l/G))^ (with E^~ (u) := inf {v : E (v) > u} , for 
0 < u < 1, denoting the quantile function pertaining to a function E), |A| and \A 2 \ 
are some regularly varying functions with negative indices (second-order parameters) 
r and r 2 respectively. 


Theorem 2.1. Assume that the second-order regular variation condition (2.7) and 
(1.5) hold. Let k := k n be a sequence of integers such that k —> 00 , k/n —> 0, 
y/kA ( n/k ) = O (1) = \fkA 2 (' n/k ). Then, there exist two standard Wiener processes 
{Wi (t ), t > 0 } , i — 1 , 2 , defined on the probability space (0, A, P) with covariance 
function R (•, •), such that 

* {wm - 1 ) - ^ (1) = MU = VE - 1 ) - ^ (1). 

and 

Vk (71 -71 )- n (k) 

r 1 (cWi ( t ) - c 2 W 2 (t)) dt - cWi (1) + c 2 W 2 (1) + Op (1), 

where c := 7 ^/ 7 , c 2 := 71/72 and 

cy/kA (n/k) c 2 yfkA 2 (n/k) 

7 (1 - r) 72 (1 - t 2 ) 



p(k) : 
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Corollary 2.1. Under the assumptions of Theorem 2.1, we have 

Vk (71 - 7i) % J\f (/i, cr 2 ) , as n — > oo, 
provided that VkA ( n/k ) —» A and VkA 2 ( n/k ) —>• A 2 , where 


0 ■ = 


H- 77 ——7 and cr 2 := 2 c 2 + 2 c 2 — 2cc 2 S, 


7 (1 - r) 72 (1 - t 2 ) 


with 


5 = 5 (R) := [ ^f dsdt- [ (R (s, 1) - R (1, s)) ds + R (1,1) 



'0 Jo J 0 

Remark 2.1. Note that cr 2 is finite. Indeed, the fact that I(x,y ) is a tail copula 
function, implies that max (x, y) <I(x,y) < x + y (see, e.g., Gudendorf and Segers, 
2010) and since R(x,y ) = x + y — £(x,y), then 0 < R (x, y) < min (x, y). It follows 
that 

1 rl R(s,t) f 1 f 1 minis,t) , 

y ’ dsdt < j j - -f^—Ldsdt = 2 , 



0 Jo 


0 J 0 


st 


»i 


/ R (s, 1) ds < -, / R (1, s) ds < - and R (1,1) < 1. 

Jo 2 J 0 2 

Therefore |5| < 4, which yields that a 2 < 00 . 

The following corollary directly leads to a practical construction of confidence inter¬ 
vals for the tail index 7 x. 

Corollary 2.2. Under the assumptions of Corollary 2.1, we have 

Vk (71 — 71) — (2 v 

-> N (0,1), as n —>■ 00 , 


where (1 = 


cA 


+ 


cr 

C 2 A 2 


and a 2 := 2 c 2 + 2c| — 2cc 2 5, with 


7 (i — f) 72 (1 - f 2 ) 

c := 7i/7, c 2 := 71 / 72 , 5 := 5 (R) , 


A := \/fcr 


-A-n— 2 fc:n 2 ^ X n — k:n 


and A 2 := \//cr 2 


n—2k:n ^ * n—k:n 


2 _ 7 ( 2 —r _ x) x n _ fc:n 2-^ (2-2 _ i) Yn _ k :n 

Here 7 and 72 are the respective Hill estimators of 7 and 72 defined in (1.3) with k' = 
k, t (resp. t 2 ) is one of the estimators ofr (resp. r 2 ) (see, e.g., Gomes and Pestana, 
2007) and R is a nonparametric estimator of R given in Peng (2010) by R(s,t) := 
k~ Y YJ!i= 1 1 i x i > x n-[ks\:n, Y % > Y n-{kt]-.n) , with [x] standing for the integer part of 
the real number x and 1 (•) for the indicator function. 
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3. Simulation study 

We carry out a simulation study to illustrate the performance of our estimator, 
through two sets of truncated and truncation data, both drawn from Burr’s model. 
We have 

F (x) — 1 — (l + x 1//<5 ) ^ 7l and G (y) — 1 — (l + y 1//<5 ) ^ 72 , x, y > 0, 

with 5 > 0 and 0 < 71 < 72 . The second-order parameters of (2.7) are r = —2y /S 
and r 2 = — 72 / 5 . The truncation probability is equal to 1 —p with p = 72 / (71 + 72 ). 
We fix p — 0.7, 0.8, 0.9 and 71 = 0.6, 0.8. The corresponding 72 —values are obtained 
by solving the latter equation. We vary the common size N of both samples and for 
each size, we generate 200 independent replicates. Our overall results are then taken 
as the empirical means of the values obtained in the 200 repetitions. To determine 
the optimal number of upper order statistics used in the computation of 71 , we apply 
the algorithm of page 137 in Reiss and Thomas (2007). 

This study consists in two parts: point estimation and 95%—confidence interval 
construction. In the first part, we evaluate the bias and the root of the mean 
squared error (rrnse) of 71 while in the second, we investigate the accuracy of the 
confidence intervals of the tail index 71 , by computing their lengths and coverage 
probabilities (denoted by ‘covpr’). The results of the first part are summarized 
in Table 3.1, whereas those of the second are given in Table 3.2, where ‘lcb’ and 
‘ucb’ respectively stand for the lower and upper confidence bounds. To compute 
confidence bounds for 7 x, with level (1 — () x 100 % where 0 < ( < 1 , from two 
realizations (xi,...,x n ) and (yi,...,y n ) of (Ad,..., X n ) and (Yf ,...,Y n ) respectively, 
we use Corollary 2.2 and proceed as follows. 

• Select the optimal sample fraction of top statistics that we denote by k*. 

• Compute the corresponding = 71 (k *), 7 2 = 72 (k *), 7 * = 7 (k *), c* = 
c(k*) and = c 2 (k*). 

• Calculate r* = r (k*) and = r 2 {k*) via one of the available numerical 
procedures (see, e.g., Gomes and Pestana, 2007) and then get A* = A (k *), 
A* = A 2 (fc*). 

• Evaluate 6 * = 5 (k*) by means of Monte Carlo integration. 

• Compute ft* — p (k*) and a* = a (k*). 


p = 0.70 


N 

n 

k 

7i 

bias 

rrnse 

n 

k 

7i 

bias 

rrnse 

500 

350 

15 

0.515 

-0.084 

0.299 

349 

15 

0.673 

-0.127 

0.356 

1000 

701 

35 

0.555 

-0.044 

0.264 

699 

32 

0.704 

-0.095 

0.307 

1500 

1049 

50 

0.554 

-0.046 

0.212 

1049 

51 

0.751 

-0.049 

0.259 

p = 0.80 

500 

400 

18 

0.521 

-0.079 

0.233 

400 

18 

0.723 

-0.077 

0.351 

1000 

801 

43 

0.566 

-0.034 

0.181 

799 

40 

0.713 

-0.087 

0.273 

1500 

1198 

64 

0.566 

-0.033 

0.145 

1200 

64 

0.752 

-0.048 

0.203 


p = 0.90 


500 

450 

22 

0.547 

-0.053 

0.186 

449 

20 

0.702 

-0.098 

0.295 

1000 

900 

45 

0.558 

-0.042 

0.148 

900 

49 

0.747 

-0.053 

0.189 

1500 

1349 

76 

0.577 

-0.023 

0.118 

1348 

77 

0.755 

-0.045 

0.151 


Table 3.1. Point estimation of the tail index based on 200 samples 
from randomly right-truncated Burr population with shape parameter 
0.6 (left panel) and 0.8 (right panel) 

At last, the (1 — () x 100 %—confidence bounds for the extreme value index 71 are 

+ A ± ’ 

where 77 2 is the (1 — £/ 2 ) —quantile of the standard normal rv. 

On the light of the results of both tables, we see that truncation is the factor that 
affects most the estimation process of the tail index. As we would have expected, the 
smaller the truncation percentage is, the better and more accurate the estimation 
is, for both index values and each sample size. The reason why we don’t consider 
small samples (we start with a size of 500) is that, in extreme-value theory based 
inference, large samples are needed in order for the results to be significant. This 
motivation becomes more obvious when, in addition, there is truncation. 

4. Application: excess-of-loss reinsurance premium estimation 

As an application of Theorem 2 . 1 , we derive the asymptotic normality of an esti¬ 
mator of the excess-of-loss reinsurance premium obtained with truncated data. Our 
choice is motivated mainly by two reasons. The first one is that reinsurance is a 
very important field of application of extreme value theory and the second is that 
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p = 0.70 


N 

lcb—ucb 

covpr 

length 

lcb—ucb 

covpr 

length 

500 

0.226 - 0.993 

0.94 

0.767 

0.294- 1.199 

0.92 

0.905 

1000 

0.319-0.851 

0.94 

0.532 

0.428 - 1.099 

0.91 

0.671 

1500 

0.396 - 0.801 

0.90 

0.405 

0.516-1.043 

0.89 

0.527 


p = 0.80 


500 

0.242 

-0.880 

0.93 

0.638 

0.313- 

1.158 

0.93 

0.845 

1000 

0.376 

- 0.790 

0.92 

0.414 

0.497- 

1.041 

0.92 

0.544 

1500 

0.436 

- 0.759 

0.91 

0.323 

0.576 - 

1.003 

0.91 

0.427 


p = 0.90 


500 

0.317 

- 0.820 

0.90 

0.503 

0.438 

- 1.085 

0.92 

0.647 

1000 

0.408 

- 0.750 

0.91 

0.342 

0.540 

- 0.998 

0.90 

0.458 

1500 

0.445 

- 0.727 

0.90 

0.282 

0.591 

- 0.965 

0.90 

0.374 


Table 3.2. Accuracy of 95%-confidence intervals for the tail index 
based on 200 samples from randomly right-trnncated Burr population 
with shape parameter 0.6 (left panel) and 0.8 (right panel) 


data sets with truncated extreme observations may very likely be encountered in in¬ 
surance. The aim of reinsurance, where emphasis lies on modelling extreme events, 
is to protect an insurance company, called ceding company, against losses caused by 
excessively large claims and/or a surprisingly high number of moderate claims. Nice 
discussions on the use of extreme value theory in the actuarial world (especially in 
the reinsurance industry) can be found, for instance, in Embrechts et al. (1997), a 
major textbook on the subject, and Beirlant et al. (2004). 


Let X!,...,X n (n > 1) be n individual claim amounts of an insured heavy-tailed 
loss X with finite mean. A Pareto-like distribution, with tail index greater than or 
equal to 1, does not have finite mean. Hence, assuming that E [X] exists necessarily 
implies that 71 < 1. In the excess-of-loss reinsurance treaty, the ceding company 
covers claims that do not exceed a (high) number u > 0, called retention level, while 
the reinsurer pays the part (X, — u) + := max (0, X.; — u) of each claim exceeding u. 
The net premium for the layer from u to infinity is defined as follows: 


F (x) dx, 


n = n(u) := E [(X — u) + ] 


•OO 
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which may be rewritten into II = wF (u) F (ux) /F (u) dx. By using the well- 
known Karamata theorem (see, for instance, Theorem B.1.5 in de Haan and Ferreira, 
2006, page 363) we have, for large u, 

fl ~ ———uF (u ), 0 < 7 i < 1. 

1 - 7i 

As we see, a semi-parametric estimator for F is needed in order to estimate the 
premium II. To this end, let us define 

C (z) := P (X < x < Y | X < Y) = P (X < x < Y ), 

with Y being the truncation rv introduced in Section 1. This quantity C is very cru¬ 
cial as it plays a prominent role is the statistical inference under random truncation. 
In other words, we have 

C (x) = p~ L F (z) G (x) = F (z) -G(x) = G(x)-F (z). 

It is worth mentioning that, since F and G are heavy-tailed then their right end¬ 
points are infinite and thus they are equal. Therefore, from Woodroofe (1985), the 
functions F, F and C are linked by 


C (x) dF (z) = F (z) dF (z), 

known as self-consistency equation (see, e.g., Strzalkowska-Kominiak and Stute, 2009), 
whose solution is 

F (z) = exp —A (z), (4.8) 

where A (z) := / t °° dF (z) /C (z) . Replacing F and C by their respective empirical 
counterparts F n (z) := n _1 1 (A,; < z) (the usual empirical df based on the 

n 

fully observed sample (Ad, ...,X n )) and C n (z) := n ” 1 1 (A,- < z <Yj) , yields the 

Z— 1 

well-known Lynden-Bell product limit estimator (Lynden-Bell, 1971) of F, 


F„ (z) = exp -A n (z), (4.9) 

where A n (z) := J)°° dF n (z) /C n (z) . If there are no ties, F„ may be put in the form 


F„W:= [I I 1 




(Xi :r 


(4.10) 


Since F is regularly varying at infinity with index — 1 / 71 , then 
F (z) ~ F (U ( n/k )) (x/U ( 1 n/k))~ 1/jl 


, as z —> cxd. 
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This leads us to derive a Weissman-type estimator (Weissman, 1978) 


F (x) = 


x 


X n —k'.n 


- 1/71 


F n {^-n—k-.n) 1 


for the distribution tail F with truncated data. Note that 

n 

F n (X n -k:n) = I 1 ~ 


i=n-k-\-1 

Thus, the distribution tail estimator is of the form 

-1/71 ( k 


nC n (X i: 


F (x) := 


x 


X n _ 


n—k:n 


i-n * 


2—1 


7iC n (d7 r) _j_|_^ :ri ) 


Consequently, we define an estimator fl n to the premium fl as follows: 

1-1/71 ( k 


ff - 71 X 1 U 

• .. ^ n—k'.n 


l-7l 


■An—k:n 


i-n 1 


z=l 


nC n (. X n -i+l-.n) 


This estimator coincides with that proposed and applied to the Norwegian fire data 
by Beirlant et al. (2001), in the non truncation case. Prior to establish the as¬ 
ymptotic normality of n n (Theorem 4.2), we give, in the following basic result, an 
asymptotic representation to the Lynden-bell estimator F„ (in XA-k-.n)- This result 
will of prime importance in the study of the limiting behaviors of many statistics 
based on truncated data exhibiting extreme values. 

Theorem 4 . 1 . Assume that the second-order conditions of regular variation (2.7) 
hold with < y 2 . Let k := k n be a sequence of integers such that k —> oo, k/n —> 0. 
Then 

7172 ^ s- 7 / 72 - 1 ^ ( s ) ds + 72 


^/ F n (X n _ fc:n ) 

V F (. X n _ k:n ) J 


(7i + 72 ) Jo 


7i + 72 


Wh(l)+o p (l) 


Consequently, 




72 

—^, as n —> 00. 
72 -7i. 


Remark 4 . 1 . Under the assumptions of Theorem f.l, we have 

Fn (X n _ fc :rl ) p 

—- > 1 , as n —> 00. 

F(X n _ k:n ) 

To establish the asymptotic normality fl n , we require the second-order regular vari¬ 
ation to F. That is, we suppose that 


lim —— [ - x~ lhl ) = x~ lhl 


t->-o o A (t) \ F (t) 


x 


1/71 _ x 


T 171 


(4.11) 
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for any x > 0 , where |A| is some regularly varying function at infinity with in¬ 
dex where T\ < 0 is the second-order parameter. For asymptotic theory 

requirements, one has to specify the relation between the retention level u and 
the quantile U ( n/k ). Indeed, as mentioned in Vandewalle and Beirlant ( 2006 ), 
amongst others, extreme value methodology typically applies to u values for which 
P(X > u) = 0(l/n), hence P(A" > u) = 0(l/n). This leads to situate u = u n 
with respect to U (n/k) so that, for large n, the quotient u/U (n/k) tends to some 
constant a. 


Theorem 4.2. Assume that the second-order regular variation conditions (4.11) 
hold with 0 < 71 < 1 and 71 <72- Let k := k n be a sequence of integers such that 
k —* 00, k/n —> 0 and y/k,A (U (n/k)) —> A* < 00. Then, whenever u/U (n/k) —> a, 
we have as n — » 00, 


\[k (n n - n 


u/U (n/k)f~ lhl U (n/k) F (U (n/k)) 


V a r 

- 4 - M 




where 


a* 2 ,= ( z a z + 


2 2 


7i72 


(71 - 1 - u ) (71 - 1 ) 

77iC^* 




*2 


+ 2- 


(72 - 7i) i 1 ~ 7i) 2 1 - 7i ’ 

with a 2 as defined in Corollary 2.1, ( := ((1 — 71) log a + 71) / (71 (1 — 71) 2 ) and 

nl R (1, t) 


5* :=- + -(r( 1,1)- [ 

772 71 V Jo 

rl R(s,i) 


t 

1 /•! 


-dt 


7i + 72 \J 0 « 7/72+1 


ds 



R (s, t) 


0 Jo 


ts 


7/'T2 + 1 


dsdt I . 


5. Proofs 


5.1. Proof of Theorem 2.1. We begin by a brief introduction on the weak approx¬ 
imation of a weighed tail copula process given in Proposition 1 of Einmahl et al. 
(2006). Set Ui := F (Xf) and Vi G (Yf) , for i — 1,... ,n , and let C (x,y) be the 
joint df of (Ui, V) ■ The copula function C and its corresponding tail R , defined in 
(1.6), are linked by C (tx, ty) — R(x,y) = O (t e ) , as t j, 0, for some e > 0, uniformly 
for x,y > 0 and rna x(x,y) < 1 (Huang, 1992). Let us define 

v n (x, y) := Vk (T n (x, y) - R n (x, y )), x,y > 0 , 

where 

1 n 

T n(x,y) 

i— 1 


k 


k 


n 


n 


n (kx ky 


Ui < -x, V< -y) and R n (x, y) := -Cl —, — 


k 




n n 
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In the sequel, we will need the following two empirical processes: 

a n ( x ) := v n (x, oo) = Vk (U n (x) - x ) and f3 n (y) := v n (oo, y) = Vk (V n (y) - y ), 

where 

TT iGl •= T (r. nrd = 

k 


1 n , ^ 

u n (a;) := T n (x, oo) = - 1 ( Uj < -x 


i— 1 


n 


and 


1 n , ^ 

Vn (y) := T n (oo, j/) = - 1 ( Vi < - 2 / 


k * —' V 


From assertions (3.8) and (3.9) in Einmahl et al. (2006), there exists a Gaussian 
process Wr (x, y) , defined on the probability space (fl, A, P), with mean zero and 


covariance 


E [W R (zi, yi) W R (x 2 , 1/2)] = .R(min (x 1 ,x 2 ) , min (y 1 , y 2 )), 
such that for any M > 0 

\v n {x,y) - W R (x,y)\ 

SU P - r _/_ ..mu -= °p l 1 ) > 


(5.12) 


and 


sup 

0 <x<M 


0 <x7y<M {max (x, y)} v 

\a n (x) - Wx (x)| 


x^ 


= Op (1) = sup 

0 <y<M 


\Pn(y)-w 2 (y) I 

yV 


(5.13) 


as n —> 00, for any 0 < rj < 1/2, where 

W] (x) := W R (. x , 00) and W 2 (y) := Wr (00, y ), 

are two standard Wiener processes such that E [Wi (x) W 2 (?/)] = R (x, y). 

To prove our result, we will write the tail index estimator -{y in terms of the processes 
OL n (•) and /?„(•). We start by splitting 71 — 71 into the sum of two terms 


72 (72 - 7 ) + 727 \ j m 

T n 1 := 7^-^77-7 (7 - 7) anci T n2 := 

(72 - 7 ) (72 - 7 ) 


7 


(72 - 7 ) (72 - 7 ) 


(72 - 72) • 


Note that, for two sequences of rv’s and Vn 2 \ we use the notation Vn^ ~ 
to say thatKi' 1 ' 1 = Vn 2 ' 1 (1 + o p (1)), as n —> 00. Since both 7 and 72 are consistent 
estimators (Mason, 1982), then, as n —» 00, we have 


T n 1 ~ - (7 - 7) and T, 


7 


n 2 


-—(72-72), 

72 


where <7 and C2 are those defined in Theorem 2.1. In other words, we have, as 
n —* 00, 

(71 - 7i) Rj -\/fc (7 - 7) — — Vk (72 - 72) 

7 72 


(5.14) 
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Next, we represent \fk (7 — 7) and \fk (72 — 72) in terms of a n (•) and f 3 n (•) respec¬ 
tively. For the first term, we use the first-order condition of regular variation of F 

( 1 . 2 ) and apply Theorem 1 . 2.2 in de Haan and Ferreira ( 2006 ) to have 

) 

t~ l F (t) dt = 7, 


lim y 

n—Hx k 


\F ■*-(!— k/n) 


n 


this allows us to write 7 = — f°° t 1 F n ( t) dt. Now, we consider the following 
decomposition 7 — 7 = S n 1 + S n 2 + S n 3, where 




n 


— fc: n 


x n 


U 1 (F n (t) — F (t)) dt, S n2 := —— 


' F-'il-k/n) 


t F (t) dt 


and 


n 


S„3 := ~ k 


F~ 1 (l—k/n) 


t 1 F{t)dt — 7. 


It is easy to verify that, almost surely, we have 


k 


F n (t) = -\J n [ r F(t)). 


n 


n- 


k ' 


( 5 . 15 ) 


Without loss of generality and after two successive changes of variables (u = tX n _k-.r. 
then s = nF (tX n -k-.n) /k), we have 

^ U n (s) - s 


Sn 1 = 


l -F{x n _ k .. n ) F (1 - sk/n) 


dF (1 — sk/n ), 


which we decompose into 

^ U n (s) - s 


S n 1 = 


q (1 — sk/n) 
/■1 


dF^ (1 — sfc/n) 


U n (s) 


-dF^ (1 — sk/n ). 


J%F(x n _ k .. n ) F (1 - sk/n) 

For the purpose of using Potter’s result of Lemma 6.3 for the quantile function 
s —> F^ (1 — s ), we write 


Sni ~ 


F^ (1 - k/n) 


+ 


! 1 F (1 — sk/n ) 
f 1 F^ (1 - k/n) 


(U n {s)-s)d 


F^~ (1 — sk/n) 

F (1 — k/n) 

F^~ (1 — sk/n) 


l£F(X n — k: n ) C 1 “ Sk / n ) 


(U n (s)-s)d 


F^ (1 — k/n) 


This allows us to write 

r l 


S n l ~ 7 W s (U n (s) — s) ds — 


10 


'%F(X n _ k;n ) 


s (U n (s) — s) ds 


In other words, we have, as n —* 00, 

VkS nl ~ 7 < / s _1 an (s) ds 


\F(X n _ kin ) 


s 1 a n (s) ds > . 


( 5 . 16 ) 
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As for the second term S n 2 , we use the mean value theorem to get 


Yl _ 

*5*712 (A n—k:n U ( Jl/k )) Z, n F (z n ) , 

where z n is a sequence of rv’s lying between X n _ k:n and U (n/k). Observe that we 
have 

A* (Zn) U (n/k) f \ n — k:n 


S n o = 


F(F^(l-k/n)) z n \U (n/k) 


1 • 


Since X n _ k . n /U (n/k) A 1, then z n /U (n/k) A 1 and — F (z n ) A 1. It follows that 

k 


5, 


n 2 


x n 


n—k:n 


l . 


U (n/k) 

Recall that C/* = F (Ah) and note that U l:n = F (A" n -i+i :n ) , therefore 

' F^~ (1 - U k+1:n ) 


S , 


1 • 


U (n/k) 

We use Potter’s bound inequalities (see Lemma 6.3) together with the mean value 
theorem to write S n2 « 7 (J^U k+1:n - lj . Since U n (j;U k+1:n ^j = 1, then 


\/~kS n 2 ~ 


(5.17) 


By summing up (5.16) and (5.17) and making use of the weak approximation (5.13) 
for a n (•), we get 


Vk(s nl + s, 


n2) 


7 < / s L W\ (s) ds — 


l :F(X n _ k;n ) 


S _ 1 ^i(s) ds - W] (Jj-U k+ i :n ) 


Next, we show that I n := Jnp^ x k ^ s l W\ (s) ds A 0. For arbitrary e, $ > 0, we 


write 


P(|/ n | >d) <P 


s~ 1 W 1 (s)ds> +p(|^F(A n _ fc:n )-l| >e). 


The fact that E | W\ (s) \ < s 1 / 2 implies that E J^_ e s l W\(s)ds < \[2 ^1 — (1 — e) 1//2 
Therefore, by Chebyshev’s inequality, we infer that 

P ^ s _ 1 IFi ( 5 ) ds>0j < V2d~ 2 (l - (1 - e) 1/2 ) . 


n— 


On the other hand, we have — F ( X n _ k:n ) A 1 this means that for all large n, 

rC 

(|^F(X n _ fc:n ) - l| > e) < A2e- 2 (l - (1 - e) 1/2 ) . 
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It follows that P (\I n \ > if) < y/2 (tC 2 + e -2 ) ^1 — (1 — e) 1 ^ which tends to zero 
when e,d j. 0, as sought. Since nUk+i :n /k A 1, then by using similar arguments as 
the above we have W i (■ nUk+i :n /k) = W\ ( 1 ) + o p ( 1 ). Consequently, we have 

Vk (S n i + S n2 ) = (1 + o p (1)) 7 [ s _1 IPi (s) ds - 7W1 (1). 

J o 

For the third term, it suffices to use the second-order condition of regular variation 
to obtain ^ 

XkS n3 = hMhZh(i + o( i)) 


In summary, we have 

Vk (7 - 7) = 7 [ s~ 1 Wi (s) ds - 7W1 (1) + ( n /k) _)_ 0p . 


r 


(5.18) 


^2 _ 

Likewise, we write 72 = — J^° ^ t 1 G n (t) dt , where G n (x) := n 1 EIU 1 (E < x) 
is the usual empirical df based on the fully observed sample (Yi,..., Y n ). Then, by 
using similar arguments, we express 72 in terms of the process f3 n (•) as follows: 


Vk (72 - 72) 


72 { f s 1 /3 n (s)ds- [ _ 

'0 J%G(Y n _ k:n ) 


S ftn (®) ds (J n Lfc+l:n^ 


Then by using approximation (5.13) for /3 n (•), we obtain 

Vk (72 - 72) = 72 [ S _1 IP 2 (s) ds - 72IF2 (1) + ^/ fc ) + Op (1). (5.19) 

Finally, substituting results (5.18) and (5.19) in equation (5.14) achieves the proof.□ 
5.2. Proof of Corollary 2.1. Elementary calculations, using the covariance for- 

j -12 

mula (5.12) and the fact that E f Q s^Wi (s) ds = 2, i = 1,2, straightforwardly 
lead to the result. 


□ 


5.3. Proof of Corollary 2.2. It suffices to plug the estimate of each parameter in 

the result of Corollary 2.1. To estimate the limits A and A2, we exploit the second- 

order conditions of regular variation (2.7) . We have, as z —» 00, 

a / \ U (zx) /U (z) - x 7 r 

A z ~ T -————-, for any x > 0. 

x 7 (x T — 1) 

In particular, for x = 1/2, and 0 = n/k , we have 


A (n/k) ~ r 


2-7 (2" r - 1) 
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Hence, we take 

'T / /j \ ^X n _2k:n\/X n —k:n 2 _ ^X n —2k:n 2 7 X n _ k:n 

[n/ ) = T 2-7 (2-^-1) =r 2-7(2---l)X n _fc:„’ 

an estimate of A (n/k ). Thus, the expression of A readily follows. The same idea 
applies to A 2 as well. □ 

5.4. Proof of Theorem 4.1. For convenience we set 

j-j* _F n (X n _ k:n ) F (A n _ k:n) 

n '~ F(X n _fc:n) ' 

- P 

Since F is regularly varying at infinity with index —1/71 and X n _ k:n /U ( n/k ) —> 1, 
then F (X n _ k:n ) « F (U (n/k)) and therefore 

j-j* ^ (X n _ k - n ) F (X n _ k:n ) 

n F (U(n/k)) ' 

Using equations (4.8) and (4.9) , we have 

j-j* ^ {1 exp A n (X n _ k:n )} {1 exp A (X n — k:n )y 

n 1 — exp — A (U (n/k)) 

Since both F n (X n _ k:n ) and F (X n _ k:n ) tend to zero in probability, then A n (X n _ k:n ) 
and A (X n _ k . n ) go to zero in probability as well. Hence, by using the approximation 
1 — exp(— x) ~ x, as x —* 0, we get 

A n (X n — k:n ) A (X n — k:n ) 


d 


= : D, 


A (U (n/k)) 

Now, we study the asymptotic behavior of D n . The numerator 


A-n (X n — k:n ) A ( X n _ k:n ) 


dF n (z) + r 


ix. 


n — k:n 


dF(z) 

Cn (z) ' J Xn _, n C (z) 


may be decomposed into SA1+ S n 2 + S n 3, with 
5 d(F n (z)-F(z)) 


S n 1 := - 


' U (n/k) 


■, S n 2 — 


and 


C{z) ’ J Xn _,ACu(z) C(z) 

_ [”<■»/*) d(F n (z)-F(z)) 

k>n3 ~ I - 


dF n (z) , 


X . C(z) 

We will show that \fkS n \/A(U (n/k)) is an asymptotically centred Gaussian rv 
while both \fkSra/A (U (n/k)) and VkS n 3/A (U (n/k)) tend to zero (in probability) 
as n —* 00. An integration by parts yields that S n 1 = , where 

(i) _ F n (U (n/k)) - k/n 
nl ' C(U(n/k)) 
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and (with a change of variables) 


c( 2 ) ._ 

°nl - — 


F n (zU ( n/k))-F(zU (■ n/k )) 


It is easy to verify that 


C 2 (zU(n/k)) 
k/n 


dC (zU (n/k)). 


~o. n ( 1 ), 


A (U (n/k)) K(U (n/k))C(U (n/k)) 
where ot n (-) is the uniform tail empirical process defined at the beginning of the 
proof of Theorem 2.1. From Lemma (6.2) , we infer that 

VksS/A (U (n/k)) « 77f 1 a n (1). (5.20) 

:( 2 ) 


For the term we have 


Jl C 2 (zU(n/k)) n \k { [ ' ’w C(U(n/k)) 

A (U (n/k)) ~ (n/k)A(U(n/k))C(U(n/k)) 

From Lemma (6.1), we know that the function C' is regularly varying at infinity with 
index — 1 /72, then by using Potter’s inequality, together with (6.26), we get 

Vks% 


A (U (n/k)) 


(71 +72) 1 / ^ 1/72 1 a n (j;F (zU (n/k )) j dz, 


Jl Jl 

which, by the change of variables s = —F (zU (n/k)) = —F ( zF (1 — k/n )), be- 

rC rv 


comes 


Vks$ 


(71+72) 1 / (^nis)) 1 ' 12 1 a n (s) dl/n (s) , 


A (U (n/k)) 


1/72—1 


where i/> n (s) := F^ (1 — ks/n) /F^ (1 — k/n ). Making use, once again, of Potter’s 
inequality of Lemma 6.3 to the quantile function s —> F (1 — s ), yields 


VksS 


7 l 72 


,-7/72-1 


a n (s) ds. 


A (U (n/k)) (71 + 72) Jo 

Subtracting (5.21) from (5.20) and using the weak approximation (5.13), we get 


(5.21) 


VkS , 


nl 


7 i 72 


s -7/72-l Wl ( s ) ds + l_ Wl (!) + 0p (!) . 


A (U (n/k)) (71+72) do 


7i 


Note that the centred rv s 7 + 2 1 W\ (s) ds has a finite second moment (in fact it 
is equal to 2y|/ ((y 2 — 7) (72 — 2 q)). As a result, the approximation above becomes 

VkS n 1 


7 i 72 


s -l/l2-l Wi ( 5 ) + ( 1 ) + o ( 1 ) . 

7i 


A (U (n/k)) ( 7l + 72 ) 2 y 0 

Now, we consider the second term S n 2. Since F n (z) =0, for z > X n:n , then 

/ ‘* n:n C n (z)-C(z) 


S n 2 = 


IX, 


n—k:n 


C n (z)C(z) 


-dF n (z). 
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It follows that 


15^2 I < Or 


IX, 


n—K:n 


\C n (z)-C(z)\ 
C 2 (z) 


dF n (z ), 


where 0 n := sup Xl . n<z<Xn . n {C (z)/C n (z)} , which is stochastically bounded (see, 
e.g., Stute and Wang, 2008). We have C = G — F and C n = G n — F n , then 
| S n 2 1 < 0 n (T n 1 + T n2 ) , where 


T n l ■ — 


'X„ 


\F n (z)-F(z)\ 
C 2 (z) 


dFn (z) and T n2 := 


Xn. — k 


I G n (z)-G(z) | 
C 2 (z) 


dF n (z) • 


The set A n , e '■= {\X n -k:n/U ( n/k ) — 1 | > e} , 0 < e < 1 is such that P (T,, i£ ) — > 0 
as n — > oo. For convenience, let u n ,e (1 — e)U (n/k) and 

1 |F„ (*)-?(*) | 


T„i e) := 


It is obvious that, for d > 0, 

VkT n i 
A (U (n/k)) 


> d < P 


C 2 (z) 

VkT nl (e) 
A (U (n/k)) 


dF n (z). 


> d + P (A n ) • 


Then it remains to show that P 
let us write 

VkTn 1 (e) 


VkT n i (e) 
A (U (n/k)) 


> d ) —> 0 as n —> oo. To this end, 


k/n 


VkT nl (e) \F(U(n/k))\ 

A (U (n/k)) [C(U (n/k)) j \ A (U (n/k)) C (U (n/k)) 

C(U (n/k))\ 2 r°° 1 a n {nF (zu n , e ) /k) \ F n (zu n>£ ) 
C(u n ,e.) J Ji [C(zu n , e )/C(u nie )f F(U(n/k)) 


->■ 


The regular variation property of C, that implies that C (U (n/k)) /C (u 71y 
(1 — e) 1 ^ 72 , as n —> oo , together with (5.13), (6.26) and Potter’s inequality (see 
Lemma 6.3), give 


VkTni ( 6 ) . F (U (n/k)) 7 /"“ F„(zu n ,,) 

A (U (n/k)) >C (U(n/k)) 7l J, ~ F(U(n/k))' 

The expectation of the integral in the previous equation equals 

/ OO 

z^d(F(zu n ,)/F(U(n/k))), 

which, by routine manipulations and the fact that the parameters 71 and 72 are 
such that 71 < 72 , converges to — (1 — e ) -1 ^ 7 72 / (27 — 72 ) as n —» 00 . On the 
other hand, we have F (U (n/k)) = k/n and (k/n) /C (U (n/k)) — » 0 as n —» 00 
(from Lemma 6.1). Therefore, VkT n \ (e) /A (U (n/k)) A 0 as n —» 00 and so does 
VkT n xj A (f/ (n/k)). Similar arguments lead to the same result for VkT n2 / A (U (n/k)) , 
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therefore we omit details. Finally, we focus on the third term S n 3 , for which an in¬ 
tegration by parts yields 

_ f v '" ,k) F„(z)-F(z) 

S„ s -/ dCiz) 

F n (X n _ k:n ) - F (X n _ k:n ) ~F n (U (: n/k)) —~F{U (n/k)) 

C(X n _ k:n ) C (U (n/k)) 

Changing variables and using the process a n (•), we get 
VkS n 3 k/n 


MU (n/k)) C (U (n/k)) A (U (n/k)) 


x 


n 


f 1 a n y-F(zU (n/k))j jC(zU(n/k)) 

lx n . k:n /u(n/k) (C (zU (n/k)) /C (U (n/k ))) 2 { C (U (n/k)) 

, C (U (n/k)) fn- 


C (X n _ k:n ) 


^ -C (X n _ k:n )^ OL n (1) 


For convenience, we set 


:= m&x(X n _ k . n /U (n/k) , 1) and := min (X n _ k , n /U (n/k ), 1 ). 

By using routine manipulations, including Potter’s inequality (see Lemma 6.3) and 
the fact that sup 0<t<1 a n (t) is stochastically bounded, we show that 


VkS, 


n3 


7 


Op (1) (£ - C) + 


C (U (n/k)) ( n- 


^ X (X n _ k:n )'^ OL n (1) 


A (U (n/k)) 7l I PWV " nJ C (X n _ k:n ) 

Since £+ - = |1 - X n _ k:n /U (n/k) | and X n _ k:n /U (n/k) A 1, then £+ - C °- 

Now, it is clear that 

VkS n3 7 (C(U(n/k)) 


* (tt ( // \ \ 1 r< ( V \ \ ^ i^n-kin) ) a n ( 1 ) 

A (U (n/k)) 7 i [C(X n _ k;n ) V \k 


n- 


C (U (n/k)) 
C (X n _ k:n ) 


a n ( 1 ) | + O p ( 1 ) . 


We have C (U (n/k)) /C (X n _ k:n ) A 1 and a n (1) = O p (1), then it suffices to show 
that a n F (X n _ k:n )^J — a n (1) A 0. Indeed, making use of the approximation 
(5.13), we get 

«n (%F (.X n _ k:n )) - a n (1) = W, Qf (X n _ k:n )^j - Wi (1) + o p (1). 

Since { W\ (t) , 0 < t < 1} is a Wiener process, then it is easy to verify that 
Wi F(X n _ k:n )) - Wi (1)| = |Wi (| ^F(X n _ k:n ) - l|) 
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Recall that — F (X n _ k . n ) A 1, then by using similar arguments as those used in the 
k 

/ Jl _ 

proof of Lemma 5.2 (i) in Brahim et ah (2014), we show that W\ I — F ( X n _ k:n ) ~ 1 

V K 

tends to zero in probability, which implies that VkS n s /A (U ( n/k )) A 0 as well. In 
summary, we showed that 


VkF>* = 


7i72 


s~ 7 / 72 - 1 ^ ( s ) ds + J_ Wi (!) + 0p (!) ; 


(7i + 72 ) Jo 


7i 


which leads to the wanted result. Finally, with some elementary calculations, we get 
the variance of the Gaussian variable \/k (F n /F (X n _ k:n ) — l) and conclude 

the proof. □ 

5.5. Proof of Theorem 4.2. For the sake of notational simplicity, we set £ = £ n : = 
U (n/k). Let us rewrite II into 


II = uF ( u ) 

and consider the decomposition 


1,00 F (ux) , 

—- ax, 

F (u) 




n n -n 


where 


Snl ■ = 

S n 2 

Sn 3 
SnA 

Snb '■ = 


(u/£f~ lhl (F 

(u/X^) 1 - 1 ^ 


(u/£) 


1-1/71 


- 1 


2—1 

7l -'k n—k:n F n (X n — k:n ) 
l-7i * 


X-n—k\n Fn (-^-n—k-.n) f 7l 


F (£) 


7i 


I-71 I-71 


7l ^ (X-n—k-.n) F n (\ n — k:n ) f X n — k - 


I -71 F(f) F(X n _ k:n ) 

7l F n (X n —k:n) J F (X n _ k:n ) 


1 - 7i F (. X n _ k:n ) 


F(£) 


X n _ 


n—k:n 


-I/71 


7l (-^n—fc:n) I f X n —k\ 


1 - 7i F (X n _ fc:n ) 


-1/71 


Sn6 — 


7i f Fn (X n _ k:n ) 


1 - 71 If (x n _ k:r 


-1 


Sn7 ■ = 


7i 


l-7i 


M\l/7 1 F(u) fF(e) 


7 


=- -dx. 


F (£) J ! F (n) 

We start by representing the five quantities \fkS ni , i — 1,2, 3, 5, 6 in terms of the 
Gaussian processes IF] and IL 2 , given in Theorem 2.1, then we show that VkS n 4 
and \/kS n 7 converge to deterministic limits. For the first term S n 1 , recall that 
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X n -k:n ~ £, which implies by the regular variation of F that F (X n _ k :n) &¥(£). On 
the other hand, we have 71 —> 71 and, from Remark 4 . 1 , F n (. X n _ k:n ) « F (X n _ k:n ). 
It follows that 


Sm 


7i 

I-71 


(u/£) 


1-1/71 


-n = 


T 1 \ 

c(l) 1 c(2) 

°nl “r °nl 

1 - 71 1 

1/71-1/71 

f( £/X n _ k:n 


By using the mean value theorem in , we have 

Sni = (1/71 - 1/71) (u/£Y n log (u/£) , 

with e n being between I/71 — I/71 and 0 . The consistency of 71 implies that e n —> 0 , 
and therefore S, r ^ fa 7 )“ 2 (7! — 7^ log (u/£). Likewise, we may readily show that 

1 7l (X-n—k:n 


o( 2 ) ^ 
°nl “ 


7l 


Consequently, 


5 , 


log (u/£) 


n 1 


n \ (7i - 7i) + 
7i (1 - 7i) 

By using similar arguments we also show that 


X 


n—k:n 


- 1 


5 


7i - 7i 


n2 


, S, 


7i 


,2> n3 

(l-7i) 

Summing these four terms, we obtain 

c , c , c , c _ i 1 - 7i) lo S ( u /£) + 71/- \ 

7i (1 - 7l) 

Now, we use the second approximation in Theorem 2.1 to have 

nr, Q . Q . Q , c 'i ~ (1 “7i)l°g a + 7i 

V k (S n i + S n 2 + S n 3 + S n < 5) -—-~2- 

7i (1 - 7i) 

x {£1- 1 {cW x it) - c 2 W 2 00) dt - cWi (1) + c 2 1F 2 (1) +fi(k) + Op 


X„_ 


n—k:n 


1 I and S, 


n5 


7i 


X„_ 


n—k:n 


( 5 . 22 ) 


The asymptotic representation of Theorem 4.1 yields 

r 1 


VkS t 


n6 


7i 


7i72 


s- 7/72-1^ ( s ) ds + l_ Wl (!) L + 0p (1). (5.23) 

7i 


1 “ 7i l (71 + 72) 2 Jo 

For the fourth term SO4, it suffices to use the second-order condition of regular 
variation ( 4 . 11 ) and the fact that X n _ k . n a £ } to get 

VkS n4: = o P (VkX (£)^ = o P (1), as n —» 00. ( 5 - 24 ) 


For the last term S n j, we first note that 


Sn7 = 


x l X x dx — 


uF (u) 


(u/£) 


1-1/71 


£F 1 


F (ux) , 
—— -dx, 

F(u) 


OO 




















23 


In addition to the the regular variation of |A| , we apply the uniform inequality of 
regularly varying functions (see, e.g., Theorem 2.3.9 in de Haan and Ferreira, 2006, 
page 48) to show that 


VkS, 


VkA< 


n7 


(ii -1 - n) (71 - l)' 


(5.25) 


Finally, gathering results (5.22) , (5.23) , (5.24) and (5.25) yields a Gaussian ap¬ 
proximation from which we derive the normal limiting distribution of the premium 
estimator Yl n . Tedious computations for the asymptotic variance complete the proof 
of the theorem. □ 


Concluding notes 

We proposed an estimator of the tail index for randomly truncated heavy-tailed data 
based on the same number of extreme observations from both truncated and trunca¬ 
tion variables. Thus, the determination of the optimal sample fraction becomes stan¬ 
dard, in the sense of applying any convenient algorithm available in the literature. 
The asymptotic normality of the estimator is established by taking into account the 
dependence structure of the observations and a practical way to construct confidence 
bounds for the extreme value index is given. The obtained Gaussian approximations 
are of great usefulness as they allow to determine the limiting distributions of several 
statistics related to the extreme value index such that high quantiles and risk mea¬ 
sures estimators (see, for instance, Necir and Meraghni, 2009). As an application, 
we provided an estimator for the excess-of-loss reinsurance premium in the case of 
large randomly truncated claims. 
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6. Appendix 

Lemma 6.1. Assume that the second-order conditions (2.7) hold with 7 ! < y 2 . Then 
the function C is regularly varying at infinity with index — 1/72 and t _1 C (U (f)) —> 0 
as t —> 00. 

Proof. We have C = G — F with 71 < 72, hence C (x) ~ G (x) as x —> 00. Since 
both F and G satisfy the second-order conditions (2.7), then in view of Lemma 3 in 
Hua and Joe (2011), there exist two constants 6,62 > 0, such that F (x) ~ Sx ~ 1 /' 7 
and G (x) ~ ^x^ 1 ^ 72 , as x —* 00. The first equivalence implies that U (t) ~ <5 7 t 7 , as 
t — > 00 , therefore C (U (f)) ~ J 2 jA^pd 72 , it follows that 

t~ l C (U (t)) ~ 5 2 5 lh2 f ,h2 - 1 . 

By assumption, we have 71 < 72, it follows that 7/72 = 71/ (71 + 72) is less to 1 / 2 , 
thus (U (t)) —> 0 as t —> 00, which achieves the proof of the lemma. □ 


Lemma 6.2. Under the assumptions of Lemma 6.1, we have 

tA (U (t)) C (U (t)) —» 71/7 as t —> 00 . 


(6.26) 


26 


Proof. Recalling that A (x) = J'(° dF (z) jC (z) and F (U (t)) — t 1 , we write 

" h C (zU (£)) F (£/(«)) 

Making use of Potter’s inequality for both C and F, we infer that, 


tA(U(t))C(U(t)) 


z l ^ l2 dz = 71 / 7 , as t —* 00 , 


as sought. 


□ 


Lemma 6.3. Suppose that tp is a regularly varying function (at infinity) with index 
p e 1, i.e. p (' tx) /<p(t) x p , as t —* 00 , for all x > 0. Then for any 0 < e < 1, 
there exists t 0 = t 0 (e) such that for t > to, tx > to, 

(T) ( j 

(1 — e) x g min (x e , x~ e ) < —< (1 + e) x p max (x e , x~ e ) . 

v <p (t) v ' 

In other words, we have, for every xq > 0, 

p ( tx ) 


lim sup 


t —>00 


X>XQ 


<P(t) 


— x p 


= 0 . 


Proof. This result, known as Potter’s bound inequalities, is stated in, for instance, 
de Haan and Ferreira, 2006, Proposition B.1.9, Assertion 5, page 367. □ 







